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1 Introduction 

Clusters of galaxies occupy a special place in the hierarchy of cosmic struc- 
tures. They arise from the collapse of initial perturbations having a typical 
comoving scale of about 10 h~ 1 Mpc 1 . According to the standard model of cos- 
mic structure formation, the Universe is dominated by gravitational dynamics 
in the linear or weakly non-linear regime and on scales larger than this. In this 
case, the description of cosmic structure formation is relatively simple since 
gas dynamical effects are thought to play a minor role, while the dominating 
gravitational dynamics still preserves memory of initial conditions. On smaller 
scales, instead, the complex astrophysical processes, related to galaxy forma- 
tion and evolution, become relevant. Gas cooling, star formation, feedback 
from supernovae (SN) and active galactic nuclei (AGN) significantly change 
the evolution of cosmic baryons and, therefore, the observational properties 
of the structures. Since clusters of galaxies mark the transition between these 
two regimes, they have been studied for decades both as cosmological tools 
and as astrophysical laboratories. 

In this Chapter I concentrate on the role that clusters play in cosmology. 
I will highlight that, in order for them to be calibrated as cosmological tools, 
one needs to understand in detail the astrophysical processes which determine 
their observational characteristics, i.e. the properties of the cluster galaxy 
population and those of the diffuse intra-cluster medium (ICM). 

Constraints of cosmological parameters using galaxy clusters have been 
placed so far by applying a variety of methods. For example: 

1. The mass function of nearby galaxy clusters provides constraints on the 
amplitude of the power spectrum at the cluster scale (e.g., [1381116^) and 
references therein). At the same time, its evolution provides constraints 
on the linear growth rate of density perturbations, which translate into 
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dynamical constraints on the matter and Dark Energy (DE) density pa- 
rameters. 

2. The clustering properties (i.e., correlation function and power spectrum) 
of the large-scale distribution of galaxy clusters provide direct information 
on the shape and amplitude of the underlying DM distribution power spec- 
trum. Furthermore, the evolution of these clustering properties is again 
sensitive to the value of the density parameters through the linear growth 
rate of perturbations (e.g., |2"r>llll4| and references therein). 

3. The mass-to-light ratio in the optical band can be used to estimate the 
matter density parameter, J?,,,, once the mean luminosity density of the 
Universe is known and under the assumption that mass traces light with 
the same efficiency both inside and outside clusters (see GDI 123 as 
examples of the application of this method). 

4. The baryon fraction in nearby clusters provides constraints on the matter 
density parameter, once the cosmic baryon density parameter is known, 
under the assumption that clusters are fair containers of baryons (e.g., 

llfi9| L Furthermore, the baryon fraction of distant clusters provide a 
geometrical constraint on the DE content and equation of state, under the 
additional assumption that the baryon fraction within clusters does not 
evolve (e.g., 

An extensive presentation of all these methods would probably require a 
dedicated book. For this reason, in this contribution I will mostly concentrate 
on the method based on the evolution of the cluster mass function. A substan- 
tial part of my Lecture will concentrate on the different methods that have 
been applied so far to weight galaxy clusters. Since all the above cosmological 
applications rely on precise measurements of cluster masses, this part of my 
contribution will be of general relevance for cluster cosmology. 

Also, since most of the cosmological applications of galaxy clusters have 
been based so far on X-ray surveys, my discussion will be definitely X-ray 
biased, although I will discuss in some detail methods based on optical obser- 
vations and what present and future optical surveys are expected to provide. 
I refer to the Lectures by R. Gal and O. Lopez-Cruz in this volume for more 
details regarding the properties of galaxy clusters in the optical band. Also, I 
will refer to the Lectures by M. Birkinshaw for cosmological studies of clusters 
based on the Sunyaev-Zeldovich (SZ) effect, to the Lectures by J. -P. Kneib 
for cluster studies and mass measurement through gravitational lensing, and 
to the Lectures by C. Jones and by C. Sarazin for more details about the 
cosmological application of the baryon fraction method. 

The structure of this Chapter will be as follows. I provide in Section 2 a 
short introduction to the basics of cosmic structure formation. I will shortly 
review the linear theory for the evolution of density perturbations and the 
spherical collapse model. In Section 3 I will describe the Press-Schechter (PS) 
formalism to derive the cosmological mass function. I will then introduce 
extensions of the PS approach and present the most recent calibrations of the 
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mass function from N-body simulations. In Section 4 I will review the methods 
to build samples of galaxy clusters, based on optical and X-ray observations, 
while I will only briefly discuss the SZ methodology for cluster surveys. Section 
5 is devoted to the discussion of different methods to derive cluster masses 
and to review the results of the application of these methods. In Section 6 

1 will describe the cosmological constraints, which have been obtained so far 
by tracing the cluster mass function with a variety of methods: distribution 
of velocity dispersions, X-ray temperature and luminosity functions, and gas 
mass function. In this Section I will also critically discuss the reasons for 
the different, sometimes discrepant, results that have been obtained in the 
literature and I will highlight the relevance of properly including the analysis 
of the cluster mass function all the statistical and systematic uncertainties in 
the relation between mass and observables. Finally, I will describe in Section 7 
the future perspectives for cosmology with galaxy clusters and which are the 
challenges for clusters to keep playing an important role in the era of precision 
cosmology. 

2 A concise handbook of cosmic structure formation 

In this section I will briefly review the basic concepts of cosmic structure 
formation, which are relevant for the study of galaxy clusters as tools for 
precision cosmology through the evolution of their mass function. A complete 
treatment of models of structure formation can be found in classical cosmology 
textbooks (e.g., p3IIT53l l32p. 

2.1 The statistics of cosmic density fields 

Let /3(x) be the matter density field, which is a continuous function of the 
position vector x, p = ( p ) its average value computed over a sufficiently large 
(representative) volume of the Universe and 

p 

the corresponding relative density contrast. By definition, it is S = and 
(5(x) > — 1. If the density field is traced by a discrete distribution of points 
(i.e., galaxies or galaxy clusters) all having the same weight (mass), then 
p(x) = J2i$v( x ~ x 0> where 5d(x) is the Dirac delta-function. The Fourier 
representation of the density contrast is given by 

5 >) = ^/ d ^« cik - X > ( 2 ) 

with the corresponding dual relation for the inverse Fourier transform. 
The 2-point correlation function for the density contrast is defined as 
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£(r) = <*(*iW*a)>. (3) 

which only depends on the modulus of the separation vector, r — |xi — X2I, 
under the assumption of statistical isotropy of the density field. Therefore, 
it can be shown that the power spectrum of the density fluctuations is the 
Fourier transform of the correlation function, so that 

P{k) = (\5(k)\ 2 ) = ^2 / drr^(r)^, (4) 

which, again, depends only on the modulus of the wave- vector k. 

In case we are interested in the study of a class of observable structures 
of mass M, which arise from the collapse of initial perturbations having size 
R oc (M/p) 1 / 3 , then it is common to introduce the smoothed density field, 
which is defined as 

<y«(x) = <5 M (x) = / <5(y)W R (|x - y|) dy . (5) 

As such, it is given by the convolution of the density fluctuation field with a 
window function, which filters out the fluctuation modes having wavelength 
R. Eq.JSJ allows us to introduce the variance of the fluctuation field com- 
puted at the scale R, defined as 

4 = *m = (4) = ^2 / dkk 2 P(k)W 2 (k) , (6) 

where Wn(k) is the Fourier transform of the window function. 

The shape of the window function defines the exact relation between mass 
and smoothing scale. For instance, for the top-hat window it is 

^ = 3[sin(fcfl) - kRcosjkR)} 

while the Gaussian window gives 

W fl (fe)=expf-M!V. (8) 



The corresponding relations between mass scale and smoothing scale are 
M = (4n/3)R 3 p and M = (2ttR 2 ) 3 / 2 p for the top-hat and Gaussian filters, 
respectively. 

The shape of the power spectrum is (essentially) fixed once the matter 
density parameter, ]? m , that associated to the baryonic component, ^bar, and 
the Hubble parameter, Hq, are specified (e.g., |53|). However, its normalization 
can only be fixed through a comparison with observational data of the large- 
scale structure of the Universe or of the anisotropics of the Cosmic Microwave 
Background (CMB). A common way of parametrizing this normalization is 
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through the quantity er 8 , which is defined as the variance, computed for a 
top-hat window having comoving radius R — 8/i _1 Mpc (given in eq. 
The historical reason for this choice of the normalization scale is that the 
variance of the galaxy number counts, within the first redshift surveys, was 
observed to be about unity inside spheres of that radius (e.g., |1B|). In this 
way, the value of as for a given cosmology directly provides a measure of 
the biasing parameter relating the galaxy and mass distribution, expected for 
that model. Furthermore, a top-hat sphere of 8/i -1 Mpc radius contains a 
mass M ~ 5.9 x 10 14 f2 m h~ 1 M Q , which is the typical mass of a moderately 
rich galaxy cluster. Therefore, as we shall see in Section [21 the mass function 
of galaxy clusters provides a direct measure of a& ■ 



2.2 The linear evolution of density perturbations 

Let us assume that the matter content of the Universe is dominated by a pres- 
surless and self-gravitating fluid. This approximation holds if we are dealing 
with the evolution of the perturbations in the dark matter (DM) component or 
in case we are dealing with structures whose size is much larger than the typ- 
ical Jeans scale-length of baryons. Let us also define x to be the comoving co- 
ordinate and r = a(t)x the proper coordinate, a(t) being the cosmic expansion 
factor. Furthermore, if v = r is the physical velocity, then v = ax + u, where 
the first term describes the Hubble flow, while the second term, u = a(t)x, 
gives the peculiar velocity of a fluid element which moves in an expanding 
background. 

In this case the equations that regulate the Newtonian description of the 
evolution of density perturbations are the continuity equation: 

^+V.[(l + <5)u] = 0, (9) 

which gives the mass conservation, the Euler equation 

^ + 2tf(t)u+(u-V)u = -^, (10) 

which gives the relation between the acceleration of the fluid element and the 
gravitational force, and the Poisson equation 

V 2 </> = 4irGpa 2 5 (11) 

which specifies the Newtonian nature of the gravitational force. In the above 
equations, V is the gradient computed with respect to the comoving coor- 
dinate x, </>(x) describes the fluctuations of the gravitational potential and 
H(t) = a/a is the Hubble parameter at the time t. Its time-dependence is 
given by H(t) = E(t)H , where 

E(z) = [{l + zff2 m + {l + z) 2 {l-Q m -Q DE ) + (l + zf ( - 1+ ^Q DE ] 1 / 2 (12) 
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is related to the density parameter contributed by non-relativistic matter, 
fi m , and by Dark Energy (DE), Qde-, with equation of state p = wpc 2 (if the 
DE term is provided by cosmological constant then w = — 1). 

In the case of small perturbations, these equations can be linearized by 
neglecting all the terms which are of second order in the fields 5 and u. In 
this case, after further differentiating eq.© with respect to time, using the 
Euler equation to eliminate the term du/dt, and using the Poisson equation 
to eliminate V 2 </>, one ends up with: 



This equation describes the Jeans instability of a pressurless fluid, with the ad- 
ditional "Hubble drag" term 2H(t)dS /dt, which describes the counter-action 
of the expanding background on the perturbation growth. Its effect is to pre- 
vent the exponential growth of the gravitational instability taking place in a 
non-expanding background 14,. The solution of the above equation can be 
casted in the form: 



where D + and Z?_ describes the growing and decaying modes of the density 
perturbation, respectively. In the case of an Einstein-de-Sitter (EdS) Uni- 
verse {Q m = 1, f2 DE = 0), it is H(t) = 2/(3i), so that D+(t) = (t/U) 2 / 3 and 
D-(t) = (t/uy 1 . The fact that D+(t) oc a(t) for an EdS Universe should not 
be surprising. Indeed, the dynamical time-scale for the collapse of a perturba- 
tion of uniform density p is td yn oc (Gp) -1 / / 2 , while the expansion time scale 
for the EdS model is t cxp cx (Gp)" 1 ^ 2 , where p is the mean cosmic density. 
Since for a linear (small) perturbation it is p ~ p, then tdyn ~ ^cxp, thus show- 
ing that the cosmic expansion and the perturbation evolution take place at 
the same pace. This argument also leads to understanding the behaviour for 
a fl m < 1 model. In this case, the expansion time scale becomes shorter than 
the above one at the redshift at which the Universe recognizes that Q m < 1. 
This happens at 1 + z ~ flm^ or at 1 + z ~ in the presence or absence 
of a cosmological constant term, respectively. Therefore, after this redshift, 
cosmic expansion takes place at a quicker pace than gravitational instability, 
with the result of freezing the perturbation growth. 

The exact expression for the growing model of perturbations is given by 



(e.g., |124p . I show in Figure the redshift dependence of the linear growth 
factor for an Eds model and for two models with Q m = 0.3 both with and with- 
out a cosmological constant term to restore spatial flatness. Quite apparently, 
the EdS has the faster evolution, while the slowing down of the perturba- 
tion growth is more apparent for the open low-density model, the presence of 




(13) 



(J(x,t) =<J+(x,ti)D+(t) +<J_(x,ti)D_(t), 



(14) 




(15) 
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Fig. 1. The redshift dependence of the linear growth mode of perturbations for a 
flat model with fi m = 1 (solid curve), for a flat ft m — 0.3 model with a cosmo- 
logical constant (dashed curve) and for an Q m = 0.3 open model with vanishing 
cosmological constant (dotted curve). 

cosmological constant providing an intermediate degree of evolution. A more 
pictorial view is provided in Figure [3 where we show the dark matter den- 
sity fields for two different cosmologies and at different epochs, as obtained 
from N-body simulations. The two models, an EdS one and a flat low-density 
one with f2 m = 0.3, have been tuned so as to have a similar appearance at 
z = 0. This figure clearly shows that any observational probe of the degree 
of evolution of density perturbations would correspond to a sensitive probe 
of cosmological parameters. Such a cosmological test is conceptually different 
to those provided by the standard geometrical tests based on luminosity and 
angular-size distances. 

As we shall discuss in the following, clusters of galaxies provide such a 
probe, since the evolution of their number density is directly related to the 
growth rate of perturbations. 

2.3 The spherical top hat collapse 

A spherical perturbation at constant density represents the only case in which 
the evolution can be exactly computed. Although the assumptions on which 



8 Stefano Borgani 




Fig. 2. The evolution of the cluster population from N-body simulations in two 
different cosmologies 1261 . Left panels describe a flat, low-density model with fi m — 
0.3 and Qa = 0.7 (L03); right panels are for an Einstein-de-Sitter model (EdS) with 
Q m = 1. Superimposed on the dark matter distribution, the yellow circles mark 
the positions of galaxy clusters with virial temperature T > 3 keV, the size of the 
circles is proportional to temperature. Model parameters have been chosen to yield 
a comparable space density of clusters at the present time. Each snapshot is 250/i _1 
Mpc across and 75ft -1 Mpc thick (comoving with the cosmic expansion). 
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this model is based are quite restrictive, nevertheless it serves as a very use- 
ful guideline to characterize the process of evolution and formation of viri- 
alized DM halos. This approach is based on treating the perturbation as a 
separate Friedmann-Lemaitre-Robertson-Walker (FLRW) universe, with the 
constraint of null velocity at the boundary of the perturbation. Here we will 
sketch the derivation in the case of Q m = 1 (e.g., 02], while an extension of 
this derivation to more general cosmologies can be found in [^1] and [HSj , with 
useful fitting functions provided in [3*T] . 

Assuming null velocities at an initial time ij provides the relation D + (ti) = 
(3/5)<5(ii), between the linear growth mode of the perturbation and the initial 
overdensity. The initial density parameter, which characterizes this separate 
Universe, is then f2 p (ti) = + Si). Therefore, the condition for the 

perturbation to recollapse will be Q p (ti) > 1. If this condition is satisfied, 
then we can derive the density within the perturbation at the time t m of its 
maximum expansion (turn-around) as 



Pp(t,n) = Pc(ti)f2 p (ti) 



(16) 



The time t m is given by the solution of the Friedmann equations for a closed 
Universe: 

3tt 



_ 7T O p (ti) 

m ~2H i[ n p (t i )-if 2 



32G Pp (t m ) 



(17) 



where Hi is the Hubble parameter within the perturbation. At the same epoch 
t m , the density of the general cosmic background is p(t m ) — (67rG?i^J _1 . 
Therefore, the exact value for the perturbation overdensity at the turn-around 
is 

M*0-^-i-(£)'-i*«. («) 

On the other hand, the linear-theory extrapolation to t m would give 

^777, \ O / 07T 



5+(t m )=5 + (t i )[f) = 5{-J ^ L07 - (19) 

This demonstrates that the linear-theory extrapolation significantly underes- 
timates overdensities at the turn-around. 

After reaching the maximum expansion, the perturbation then evolves by 
detaching from the general Hubble expansion and then recollapses, reaching 
virial equilibrium supported by the velocity dispersion of DM particles. This 
happens at the virialization time t v ir, at which the perturbation meets by 
definition the virial condition E = K + U = —K, being E, K and U the total, 
the kinetic and the potential energy, respectively. 

At the turn-around point, the perturbation has no kinetic energy, so that 
the total energy is 
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where we have used the expression for the potential energy of a uniform spher- 
ical density field of radius R m and total mass M. In a similar manner, the 
total energy at the virialization is 

" U 13GM2 (21) 



2 2 5 R 



vir 



Therefore, the condition of energy conservation in a dissipationless collapse 
gives R m = 2i? v ; r for the relation between the radii at turn-around and at 
virial equilibrium. This allows us to compute the overdensity at t v ir as 

Pp(ivir) _ / t vil \ ( R m \ Pp(t m ) _ 2 n / 37T X " 



= 2 2 2 d — =18^- 178, (22) 

p(tvir) \t m j \r vii j P {t m ) V 4 y v ; 

where we have accounted for both the compression of the perturbation den- 
sity, due to its shrinking, and of the dilution of the background density as 
the Universe expands from t m to t vlr . Ea. (|22(l shows why an overdensity of 
about 200 is usually considered as typical for a DM halo which has reached 
the condition of virial equilibrium. As for the extrapolation of linear-theory 
prediction, it would have given 

M*vir) = f^) 3 S + (t m ) ~ 1.69. (23) 

The above equation shows the derivation of another fundamental number that 
will be used in what follows in order to characterize the mass function of viri- 
alized halos. It gives the overdensity that a perturbation in the initial density 
field must have for it to end up in a virialized structure. While the above 
derivation holds for an EdS Universe, it can be generalized to any generic 
cosmology. For Q m < 1 the increased expansion rate of the Universe causes a 
faster dilution of the cosmic density from t m to t v ; r and, as a consequence, a 
larger value of the overdensity at virialization. 

In the following, we will indicate with Z\ v ir the overdensity at virial equi- 
librium, computed with respect to the background density, and with A c the 
same quantity expressed in units of the critical density p c . As a reference, a 
flat low-density model with fl m = 0.3 has A c ~ 100 and Z\ V i r — 330. Also, 
we will use in the following the notation Rn to indicate the radius of a halo 
encompassing an average overdensity equal to Np c , so that Mjv will denote 
the halo mass contained within that radius. As we shall see in the following, 
values often used in the literature are N = 200, 500 and 2500. 



3 The mass function 



The mass function (MF) at redshift z, n(M,z), is defined as the number 
density of virialized halos found at that redshift with mass in the range 
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[M,M + dM]. In this section I will derive the MF expression following the 
approach originally devised by Press and Schechter |132| (PS hereafter). After 
commenting on the limitations of this approach, I will discuss the accuracy 
with which improved derivations of the MF reproduce the "exact" predictions 
from N-body simulations. 



3.1 The Press Schechter mass function 



The PS derivation of the MF is based on the assumption that the fraction of 
matter ending up in objects of a given mass M can be found by looking at 
the portion of the initial (Lagrangian) density field, smoothed on the mass- 
scale M, lying at an overdensity exceeding a given critical threshold value, 
5 C . Under the assumption of Gaussian perturbations, the probability for the 
linearly-evolved smoothed field 5m to exceed at redshift z the critical density 
contrast 5 C reads 

i poo / ^2 \ y 

P >Sc (M, z) = -= — / exp - M . dS u = ^erfc 



\/2W(*)4 V 2a M {z) 2 J 2 \V2a M (z) 

. (24) 

where erfc(x) is the complement error function and <Jm{z) = S+{z)&M is the 
variance at the mass scale M linearly extrapolated at redshift z. Under the 
assumption of spherical collapse, the critical overdensity 5 C is given by the 
linear extrapolation of the overdensity at virial equilibrium, as derived in the 
previous section. In this case, it will be 5 C = 5 c (z) with a weak dependence 
upon redshift and cosmological parameters, with 5 C ~ 1.69 independent of 
z only in the case of an EdS cosmology. By definition, the above equation 
provides the fraction of unity volume, which ends up by redshift z in objects 
with mass above M. Therefore, the fraction of Lagrangian volume in objects 
with mass in the range [M, M + dM] is 



dp >Sc (M, z) 



d P>Sc {M, z) 



dM 



dM . (25) 



Since the probability of ea. l|24|l is a decreasing function of mass, the absolute 
value is required in order to have a positive-defined differential probability. 
Ea. H25l) shows a fundamental limitation of the PS derivation of the MF. In- 
deed, we expect that, as we take the limit of arbitrarily small limiting mass, 
we should recover the whole mass content of the Universe. This is to say that, 
in the hierarchical clustering picture, all the mass is contained within halos 
of arbitrarily small mass. However, integrating ea. l|25|l over the whole mass 
range gives J Q dp > s c (M, z) = 1/2. This implies that the PS derivation of the 
mass function only accounts for half of the total mass at disposition. The basic 
reason for this is that, in this derivation, we give zero probability for a point 
with 5m < 5 Cl for a given filtering mass scale M, to have 5m> > 5 C for some 
larger filtering scale M' > M. This means that the PS approach neglects the 
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possibility for that point to end up in a collapsed halo of larger mass. A more 
rigorous derivation of the mass function, which is based on the excursion-set 
formalism 23 , correctly accounts for the missing factor 2, at least for the par- 
ticular choice of a sharp-A: filter (i.e., a top-hat window function in Fourier 
space). 

Since ea. 125(1 provides the fraction of volume in objects of a given mass, 
the number density of such objects will be obtained after dividing it by the 
volume, Vm = M/p, occupied by each object. Therefore, after accounting for 
the missing factor 2, the expression for the mass function reads 

dn(M, z) _ 2 dp >Sc {M,z ) 
dM ~ V M dM 
[2 p S c 
\ irM* o M {z) 

This is the expression for the PS mass function. Although we will present 
below a more accurate expressions for the MF, this equation already demon- 
strates the reason for which the mass function of galaxy clusters is a power- 
ful probe of cosmological models. Cosmological parameters enter in ea. 1(26(1 
through the mass variance om > which depends on the power spectrum and on 
the cosmological density parameters, through the linear perturbation growth 
factor, and, to a lesser degree, through the critical density contrast S c . Taking 
this expression in the limit of massive objects (i.e., rich galaxy clusters), the 
MF shape is dominated by the exponential tail. This implies that the MF be- 
comes exponentially sensitive to the choice of the cosmological parameters. In 
other words, a reliable observational determination of the MF of rich clusters 
would allow us to place tight constraints on cosmological parameters. 



tilog <J M (z) 


exp { 


_ % \ 




dlogM 


2<tm(z) 2 J 



(26) 



3.2 Extensions of the PS approach and N— body tests 

Following |89| , an alternative way of recasting the mass function is 

M dn{M, z) 

f(a M ,z) = — — — . (27) 

p a m a M 

In this way, the PS expression is recovered by setting 

2 5 r 



f{a M ,z) = \ -exp \-Trr) (28) 

V TTCTM V M J 

Despite its subtle simplicity (e.g., 1 1 2 i. the PS MF has served for more 
than a decade as a guide to constrain cosmological parameters from the mass 
distribution of galaxy clusters. Only with the advent of a new generation of 
N-body simulations, which are able to cover a very large dynamical range, 
have significant deviations of the PS expression from the exact numerical 
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description been noticed (e.g., |77l 17511551 loTfl 1 15(J1 1166] ). Such deviations have 
been usually interpreted in terms of corrections to the PS approach. 

Incorporating the effect of non-spherical collapse, the PS expression has 
been generalized |146| to 

These authors also compared this expression with results from N-body sim- 
ulations, in which the mass of the clusters were estimated with a spherical 
overdensity (SO) algorithm, by computing the mass within the radius encom- 
passing a mean overdensity equal to the virial one. As a result, they found 
the best-fitting values a = 0.707, q = 0.3, with the normalization constant 
C = 0.3222 obtained from the normalization requirement J Q f{<JM)dv = 1 
(note that the PS expression is recovered for a = 1, q = and C = 1/2; see 
also P37J). 

Jenkins et al. 89 proposed an alternative expression for the mass function: 

f(a M ,z) = 0.315exp(-|ln<T A/ 1 +0.61| 3 ' 8 ), (30) 

which has been obtained as the best fit to the results of a combination of dif- 
ferent simulations, covering a wide dynamical range. More recently, Springel 
et al. |150| used the largest available single N-body simulation to verify in de- 
tail the accuracy of ea. H30ll . The result of this comparison, which is reported 
in Figure|21 demonstrates that this mass function reproduces remarkably well 
numerical results over a wide range of sampled halo masses and redshifts, 
thereby representing a substantial improvement with respect to the PS mass 
function. The accuracy of ea. (|30|l in reproducing results of numerical experi- 
ments has been also discussed in jS^], where it is also pointed out the role of 
different algorithms to identify clusters and to estimate their mass in simula- 
tions, in |167) . where the universality of this expression for a generic cosmology 
is discussed, and in |166| , where the widest dynamical range to date has been 
samples by combining a series of N-body simulations. 

In practical applications, the observational mass function of clusters is usu- 
ally determined over about one decade in mass. Therefore, it probes the power 
spectrum over a relatively narrow dynamical range, and does not provide 
strong constraints on the shape of the power spectrum. Using only the number 
density of nearby clusters of a given mass M, one can constrain the amplitude 
of the density perturbation at the physical scale R oc (M / Q m p cr n) 1 / 3 which 
contains this mass. Since such a scale depends both on M and on f2 m , the 
mass function of nearby (z?S 0.1) clusters is only able to constrain a relation 
between <jg and f2 m - In the left panel of Figure0]we show that, for a fixed value 
of the observed cluster mass function, the implied value of as from ea. H29(l 
increases as the density parameter decreases. Determinations of the cluster 
mass function in the local Universe using a variety of samples and methods 
indicate that crg^m — 0.4 — 0.6, where a ~ 0.4 — 0.6, almost independent 
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Fig. 3. Left panel: the mass function of DM halos (dots with errorbars) identified 
at different redshifts in the Millenniun Run |15U| . compared to the predictions of the 
mass function by |89| and by |132| . The two model mass functions are plotted with 
solid and dotted curves, respectively. Right panel: the relative standard deviation of 
erg as a function of the sample size |ltj7| from the cumulative mass function above 
two different mass limits. The solid lines indicate the width of the distribution when 
including the clustering of clusters. 

of the presence of a cosmological constant term providing spatial flatness. As 
for the evolution with redshift, the growth rate of the density perturbations 
depends primarily on Q m and, to a lesser extent, on Qa, at least out to z ~ 1, 
where the evolution of the cluster population is currently studied. Therefore, 
following the evolution of the cluster space density over a large redshift base- 
line, one can break the degeneracy between og and fl m - This is shown in a 
pictorial way in Figure [21 and quantified in the right panel of Figure mod- 
els with different values of i? m , which are normalized to yield a comparable 
number density of nearby clusters, predict cumulative mass functions that 
progressively differ by up to orders of magnitude at increasing redshifts. 

Although ca. (|3L)[) provides a very accurate and flexible tool to constrain 
the parameter space of cosmological models using the mass function of col- 
lapsed halos, nevertheless a further source of uncertainty may arise from the 
effect of cosmic variance. Fluctuations modes, with wavelength exceeding the 
size of the volumes sampled by observations, induces appreciable changes in 
the number counts of halos of a given mass. This effect has been thoroughly 
discussed in [571 HbT| . In the right panel Figure (from |167j 1 I report the 
relative variation of the power spectrum normalization, erg, induced by cos- 
mic variance, as a function of the sample size, for halos having two different 
mass limits. As expected, the variance decreases with the sample size (fluctu- 
ations on larger scales have a smaller effect), while it increases with the halo 
mass (the distribution of rarer objects suffer for a more pronounced large- 
scale modulation). This result demonstrates that a precision calibration of 
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cosmological parameters requires properly accounting for the effect of cosmic 
variance. 
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Fig. 4. The sensitivity of the cluster mass function to cosmological models |138| . 
Left panel: The cumulative mass function at z — for M > 5 x 10 14 /i~ x Mq for three 
cosmologies, as a function of as, with shape parameter r — 0.2; solid line: Q m — 1; 
short-dashed line: Q m = 0.3, Qa = 0.7; long-dashed line: fi m = 0.3, Qa = 0. The 
shaded area indicates the observational uncertainty in the determination of the local 
cluster space density. Right panel: Evolution of n(>M,z) for the same cosmologies 
and the same mass-limit, with as ~ 0.5 for the J7 m = 1 case and as = 0.8 for the 
low-density models. 



4 Building a cluster sample 

4.1 Identification in the optical / near IR band 

Abell [U provided the first extensive, statistically complete sample of galaxy 
clusters, later extended to the Southern hemisphere Based on purely vi- 
sual inspection, clusters were identified as enhancements in the galaxy surface 
density and were characterized by their richness and estimated distance. The 
Abell catalog has been for decades the prime source for detailed studies of in- 
dividual clusters and for characterizing the large scale distribution of matter 
in the nearby Universe. Several variations of the Abell criteria defining clus- 
ters were used in an automated and objective fashion when digitized optical 
plates became available (e.g., [11)21 145j ). Deep optical plates were used suc- 
cessfully to search for more distant clusters, out to z ~ 0.9, with purely visual 
techniques (e.g., [751 These searches for distant clusters became much 
more effective with the advent of CCD imaging. Postman et al. were 
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the first to carry out a V&I-band survey over 5 deg 2 (the Palomar Distant 
Cluster Survey, PDCS) . This technique enhances the contrast of galaxy over- 
density at a given position, utilizing prior knowledge of the luminosity profile 
typical of galaxy clusters. Dalcanton 44 proposed another method of optical 
selection of clusters, in which drift scan imaging data from relatively small 
telescopes is used to detect clusters as positive surface brightness fluctuations 
in the background sky. Gonzalez et al. |75| applied a technique based on sur- 
face brightness fluctuations from drift scan imaging data to build a sample of 
~1000 cluster candidates over 130 deg 2 . 

A common feature of all these methods of cluster identification is that they 
classify clusters according to definitions of richness, which generally have a 
loose relation with the actual cluster mass. This represents a serious limitation 
for any cosmological application, which requires the observable, on which the 
cluster selection is based, to be a reliable proxy of the cluster mass. 

An improved definition of richness, based on the amplitude of the galaxy- 
cluster cross-correlation function, has been applied [H| to clusters identified in 
a large area survey in R and z bands (the Red Sequence Cluster Survey). This 
survey, whose optical and X-ray follow-up, is currently underway, promises 
to unveil a fairly large number of clusters out to z~ 1.5. 

By increasing the number of observed passbands and using red colors one 
can increase the contrast with which clusters are seen in color space. In this 
way, one can increase the efficiency of cluster selection also at high redshift 
(e.g., |151l 1711 1152] ) and the accuracy of their estimated redshifts through 
spectro-photometric techniques. In this way, Miller et al. [Ill] designed a 
cluster finding algorithm which makes full use of information of both posi- 
tion and color space to detect clusters of galaxies from the SDSS. They were 
able to identify about 750 clusters out to z^, 0.2, and assessed the degree of 
completeness by resorting to a comparison with mock SDSS surveys extracted 
from large N-body simulations. Once completed, the search of clusters over 
the entire SDSS sample will provide about 2500 nearby and medium-distant 
objects. At the same time the next generation of wide field (> 100 deg 2 ) 
deep multicolor surveys in the optical and especially the near-infrared will 
powerfully enhance the search for distant clusters, out to 1. 

4.2 Identification in the X-ray band 

Already from the first pioneering attempts to map the X-ray sky ( 66 , 
see |138| for a historical review), clusters were associated with extended 
sources, whose dominant emission mechanism was recognized to be thermal 
bremsstrahlung from optically thin plasma at a temperature of several keV 
j^EHj- The all-sky survey conducted by the the HEAO-1 X-ray Observatory 
was the first to provide a flux-limited sample of X-ray identified clusters, for 
which both the flux number counts and the X-ray luminosity function have 
been computed for the first time |126j . However, it is only thanks to the much 
improved sensitivity of the Einstein Observatory |65] that X-ray surveys were 
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recognized as an efficient means of constructing samples of galaxy clusters out 
to cosmologically interesting redshifts. 

First, the X-ray selection has the advantage of revealing physically-bound 
systems, because diffuse emission from a hot ICM is the direct manifestation 
of the existence of a potential-well within which the gas is in dynamical equi- 
librium with the cool baryonic matter (galaxies) and the dark matter. Second, 
the X-ray luminosity is well correlated with the cluster mass (see Figure II 111 . 
Third, the X-ray emissivity is proportional to the square of the gas density, 
hence cluster emission is more concentrated than the optical bidimensional 
galaxy distribution. In combination with the relatively low surface density 
of X-ray sources, this property makes clusters high contrast objects in the 
X-ray sky, and alleviates problems due to projection effects that affect opti- 
cal selection. Finally, an inherent fundamental advantage of X-ray selection is 
the ability to define flux-limited samples with well-understood selection func- 
tions. This leads to a simple evaluation of the survey volume and therefore 
to a straightforward computation of space densities. Nonetheless, there are 
some important caveats described below. Pioneering work in this field |67II84| 
was based on the Einstein Observatory Extended Medium Sensitivity Survey 
(EMSS). The EMSS survey covered over 700 square degrees and lead to the 
construction of a flux-limited sample of 93 clusters out to z = 0.58, allowing 
the cosmological evolution of clusters to be investigated. 

The ROSAT satellite, launched in 1990, allowed a significant step for- 
ward in X-ray surveys of clusters. The ROSAT All-Sky Survey (RASS, H2J) 
was the first X-ray imaging mission to cover the entire sky, thus paving the 
way to large contiguous-area surveys of X-ray selected nearby clusters. In the 
northern hemisphere, the largest compilations with virtually complete optical 
identification include, the Bright Cluster Sample (BCS, |£B), and the North- 
ern ROSAT KM Sky Survey (NORAS, [22]). In the southern hemisphere, the 
ROSAT-ESO flux limited X-ray (REFLEX) cluster survey [21] has completed 
the identification of 452 clusters, the largest, homogeneous compilation to 
date. The Massive Cluster Survey (MACS, |52]) is aimed at targeting the 
most luminous systems at z > 0.3 which can be identified in the RASS at the 
faintest flux levels. The deepest area in the RASS, the North Ecliptic Pole 
(NEP, [HE]) which R OSAT scanned repeatedly during its All-Sky survey, was 
used to carry out a complete optical identification of X-ray sources over a 81 
deg 2 region. This study yielded 64 clusters out to redshift z = 0.81. 

In total, surveys covering more than 10 4 deg 2 have yielded over 1000 clus- 
ters, out to redshift z ~ 0.5. A large fraction of these are new discoveries, 
whereas approximately one third are identified as clusters in the Abell or 
Zwicky catalogs. For the homogeneity of their selection and the high degree 
of completeness of their spectroscopic identifications, these samples are now 
the basis for a large number of follow-up investigations and cosmological stud- 
ies. 

Besides the all-sky surveys, the ROSAT-PSPC archival pointed obser- 
vations were intensively used for serendipitous searches of distant clusters. 
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These projects, which are now completed, include: the RIXOS survey 
the ROSATDeep Cluster Survey (RDCS, [THIES]), the Serendipitous High- 
Redshift Archival ROSAT Cluster survey (SHARC, [32], the Wide Angle 
ROSAT Pointed X-ray Survey of clusters (WARPS, |T35]1. the 160 deg 2 large 
area survey [U3, the ROSAT Optical X-ray Survey (ROXS, ESj). ROSAT- 
HRI pointed observations have also been used to search for distant clusters in 
the Brera Multi-scale Wavelet catalog (BMW, [TTSj)- 
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Fig. 5. Solid angles and flux limits of X-ray cluster surveys carried out over the last 
two decades. Dark filled circles represent serendipitous surveys constructed from a 
collection of pointed observations. Light shaded circles represent surveys covering 
contiguous areas. The hatched region is a predicted locus of current serendipitous 
surveys with Chandra and Newton-XMM. From |138| . 



In Figure El we give an overview of the flux limits and surveyed areas of 
all major cluster surveys carried out over the last two decades. RASS-based 
surveys have the advantage of covering contiguous regions of the sky so that 
the clustering properties of clusters (e.g., \W6\ ) can be investigated. They 
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also have the ability to unveil rare, massive systems albeit over a limited 
redshift and X-ray luminosity range. Serendipitous surveys which are at least 
a factor of ten deeper but cover only a few hundreds square degrees, provide 
complementary information on lower luminosities, more common systems and 
are well suited for studying cluster evolution on a larger redshift baseline. 

A number of systematic studies have been carried out to compare the 
nature of clusters identified with the optical and the X-ray technique (e.g., 
^HlElEHj)- The general conclusion of these studies is that optically selected 
clusters are on average underluminous in the X-ray band. This suggests that 
optical selection tends to pick up objects which have not yet reached a high 
enough density to make the ICM lighting up in X-rays. 

In order for a survey to be used for cosmological applications, one needs 
to know not only how many clusters it contains, but also the volume within 
which each of them is found. In other words, one needs to define the selec- 
tion function of the survey, which depends on the survey strategy and on 
the details of the adopted cluster finding algorithm (see |138j . for a review). 
An essential ingredient for the evaluation of the selection function of X-ray 
surveys is the computation of the sky coverage: the effective area covered by 
the survey as a function of flux. In general, the exposure time, as well as the 
background and the PSF are not uniform across the field of view of X-ray 
telescopes, which introduces vignetting and a degradation of the PSF at in- 
creasing off-axis angles. As a result, the sensitivity to source detection varies 
significantly across the survey area so that only bright sources can be detected 
over the entire solid angle of the survey, whereas at faint fluxes the effective 
area decreases. An example of survey sky coverage is given in the left panel 
of Figure [H] By covering different solid angles at varying fluxes, these surveys 
probe different volumes at increasing redshift and therefore different ranges 
in X-ray luminosities at varying redshifts. 

Once the survey flux-limit and the sky coverage are defined one can com- 
pute the maximum search volume, V maxi within which a cluster of a given 
luminosity is found in that survey: 



Here S(f) is the survey sky coverage, which depends on the flux / = L/(4ndi i ), 
<Il(z) is the luminosity distance, and H(z) is the Hubble constant at z. We 
define z rnax as the maximum redshift out to which the flux of an object of 
luminosity L lies above the flux limit. The corresponding survey volumes are 
shown in the right panel of Figure El 

Once again, I emphasize that one of the main advantages of the X-ray 
selection lies in the fact that the survey selection function can be precisely 
computed, thus allowing reliable comparisons between the observed and the 
predicted evolution of the cluster population. 
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Fig. 6. Left panel: sky coverage as a function of X-ray flux of several serendipitous 
surveys. Right panel: corresponding search volumes, V(> z), for a cluster of given 
X-ray luminosity (L x = 3 x 10 44 [0.5 - 2keV] ~ LJ). From [T55) . 



4.3 Identification through the SZ effect 

The Sunyaev-Zeldovich (SZ) effect |155j allows to observe galaxy clusters 
by measuring the distortion of the CMB spectrum owing to the hot ICM. 
This method does not depend on redshift and provides in principle a reliable 
estimate of cluster masses. For these reasons, it is now considered as one of 
the most powerful means to find distant clusters in the years to come. For a 
detailed discussion of the SZ technique for cluster identification and for the 
ongoing and future surveys, I refer to the lectures by Mark Birkinshaw and to 
the reviews in |15II37| . For the purpose of the present discussion, I show in the 
left panel of Figure a comparison between the limiting mass as a function of 
redshift, expected for a X-ray and for a SZ cluster survey (from (HD))- While 
the standard flux dimming with the luminosity distance, fx oc d 2 L (z), causes 
the limiting mass to quickly increase with distance for the X-ray selection, 
this limiting mass has a much less sensitive dependence on redshift for the 
SZ selection. This is the reason why SZ surveys are generally considered as 
essentially providing mass-limited cluster samples. 

It has been recently pointed out |115| that the integrated SZ flux- 
decrement has a very tight correlation with the total cluster mass (see also 
|47|h This fact, joined with the redshift-independence of the SZ selection, 
makes the SZ identification a promising route toward precision cosmology 
with galaxy clusters. 

A potential problem with the SZ identification of clusters resides in the 
possible contamination of the signal from foreground/background structures. 
Diffuse gas, residing in large-scale filaments, are likely to provide a negligible 
contamination, as a consequence of the comparatively low density and tem- 
perature which characterize such structures. However, small halos, which are 
expected to be present in large number, contain gas at the virial overdensity. 
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Since they are not resolved in current SZ observations, their integrated con- 
tribution may provide a significant contamination. Using cosmological hydro- 
dynamical simulations, White et al. 168 have created SZ sky maps with the 
aim of correlating the SZ signal seen in projection with the actual mass of 
clusters. The result of this test is shown in the right panel of Fig. The upper 
panel shows the relation between the integrated SZ signal contributed only 
from the gas within 0.5i?2oo and M200, while the lower panel is when using 
the actual Compton-j/ parameter measured from the projected maps. Quite 
apparently, the scatter in the relation is significantly increased in projection. 
Part of the scatter is due to the different redshifts at which clusters seen in 
projection are placed. This contribution to the scatter can be removed once 
redshifts of clusters are known from follow-up optical observations. However, 
a significant contribution to the overall scatter is contributed by cluster as- 
phericity and by contamination from fore/background structures. This high- 
lights the relevance of keeping this scatter under control for a full exploitation 
of the SZ signal as a tracer of the cluster mass. 
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Fig. 7. Left panel: limiting cluster virial mass for detection in an X-ray and in 
a SZ survey (from \iS0\). Each pair of curves show the results for two Q m = 0.3 
cosmologies, having w = — 1 and w = —0.5 for the DE equation of state. Right 
panel: the relation between the Comptonization parameter and M200, from |168| . 
The upper panel shows the decrement contributed from the gas within 0.5i?2oo- The 
lower panel indicates the signal from noise-free maps projected on the light cone. 
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5 Methods to estimate cluster masses 
5.1 The hydrostatic equilibrium 

The condition of hydrostatic equilibrium determines the balance between the 
pressure force and the gravitational force: VP gas = — PgasV^, where P gas and 
p gas are the gas pressure and density, respectively, while <j> is the underlying 
gravitational potential. Under the assumption of a spherically symmetric gas 
distribution, the above equations read: 

dP gas _ d±__ GM(< r) 

dr ~ Pgas dr ~ Pgas r 2 ' {6 ' 

where r is the radial coordinate (clustercentric distance) and M(< r) is the 
total mass contained within r. Using the equation of state of ideal gas to relate 
pressure to gas density and temperature, the mass is then given by 

M«r)=-^(^ + ^n, (33 ) 
G Lim p \ dmr d In r J 

where /i is the mean molecular weight of the gas (fx ~ 0.59 for primordial 
composition) and m p is the proton mass. An often used mass estimator is 
based on assuming the /3-model for the gas density profile, 

[1 + (r/r c )2} 30/2 

|39| . In the above equation, r c is the core radius, while /3 is the ratio between 
the kinetic energy of any tracer of the gravitational potential (e.g. galaxies) 
and the thermal energy of the gas, (3 = nm p <T%/(kBT) (cr v : one-dimensional 
velocity dispersion). By further assuming a polytropic equation of state, p gas cx 
-Pgas (7 : polytropic index), ea. l|3"3"jl becomes 

M« r) c 1.11 x l°"^ ,_,' Mpcl ^ /.-M e , (35) 

where T(r) is the temperature at the radius r. In its original derivation, the 
/3-model was aimed at representing the distribution of isothermal gas sitting 
in hydrostatic equilibrium within a King-like potential. The corresponding 
mass estimator is recovered from ea- H35[) by setting 7=1 and replacing T(r) 
with the global ICM temperature, To. In the absence of accurately resolved 
temperature profiles from X-ray observations, eq. has been used to esti- 
mate cluster masses both in its isothermal (e.g., |13fl| ) and in its polytropic 

form (e.g., H2DIE2I ES|)- 

Thanks to the much improved sensitivity of the Chandra and XMM- 
Newton X-ray observatories, temperature profiles are now resolved with high 
enough accuracy to allow the application of more general methods of mass 
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Fig. 8. The mass-temperature relation for nearby clusters (from [S]) and for distant 
clusters (from 95 ), based on a combination of Chandra and XMM-Newton data. 

estimation, not necessarily bound to the assumptions of /3-model and of an 
overall polytropic form for the equation of state (e.g., [q1 I56| I51 |161| ). 

An alternative way of recasting the isothermal version of eq. I|35|l between 
temperature and mass is based on expressing the mass according to the virial 
theorem as M V - 1T = a^R v i T /G, so that 



This expression, originally introduced in |54|. has been sometimes used to 
express the M-T relation as obtained from hydrodynamical simulations of 
galaxy clusters (e.g., [8*Tll2^] ). 

It is clear that the two crucial assumptions underlying any mass measure- 
ments based on the ICM temperature concerns the existence of hydrostatic 
equilibrium and of spherical symmetry. While effects of non-spherical geome- 
try can be averaged out by performing the analysis over a large enough number 
of clusters, the former can lead to systematic biases in the mass estimates (e.g., 
133 ) and references therein). So far, ICM temperature measurements have 
been based on fits of the observed X-ray spectra of clusters to plasma models, 
which are dominated at high temperatures by thermal bremsstrahlung. How- 
ever, local deviations from isothermality, e.g. due to the presence of merging 
cold gas clumps, can bias the spectroscopic temperature with respect to the 
actual electron temperature (e.g., |l()8llll()|llti()| ). This bias directly translates 
into a comparable bias in the mass estimate through hydrostatic equilibrium 
(see Section [7| below). 
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5.2 The dynamics of member galaxies 

From a historical point of view, the dynamics traced by member galaxies, has 
been the first method applied to measure masses of galaxy clusters |148l I172| . 
Under the assumption of virial equilibrium, the mass of the cluster can be 
estimated by knowing position and redshift for a high enough number of 
member galaxies: 



(e.g., HSl), where the first factor accounts for the geometry of projection, 
ay is the line-of-sight velocity dispersion and Ry is the virialization radius, 
which depends on the positions of the galaxies with measured redshifts and 
recognized as true cluster members: 



where N is the total number of galaxies, and nj the projected separation 
between the i-th and j-th galaxies. This method has been extensively applied 
to measure masses for statistical samples of both nearby (e.g., [1711711 [TBI 11371 
11301 ) and distant (e.g., |2S||73]) clusters. 

Besides the assumption of virial equilibrium, which may be fulfilled to dif- 
ferent degrees by different populations of galaxies (e.g., late vs. early type), 
a crucial aspect in the application of the dynamical mass estimator concerns 
the rejection of interlopers, i.e. of back/foreground galaxies which lie along 
the line-of-sight of the cluster without belonging to it. A spurious inclusion 
of non-member galaxies in the analysis leads in general to an overestimate of 
the velocity dispersion and, therefore, of the resulting mass. A number of al- 
gorithms have been developed for interlopers rejection, whose reliability must 
be judged on a case-by-case basis (e.g., 1157] V A further potential prob- 
lem of this analysis concerns the possibility of realizing a uniform sampling 
of the cluster potential using galaxies with measured redshifts. For instance, 
the technical difficulty of packing slits or fibers in optical spectroscopic obser- 
vations may lead to an undersampling of the cluster central regions. In turn, 
this leads to an overestimate of Ry and, again, of the collapsed mass. 

Tests of the accuracy of mess estimates based on the dynamical virial 
method have been performed by using hydrodynamical simulations of galaxy 
clusters, in which galaxies are identified from gas cooling and star formation 
(|E3E|)- For instance, ^B] have shown that galaxies identified in the simu- 
lations are fair tracers of the underlying dynamics, with no systematic bias in 
the estimate of cluster masses, although a rather large scatter between true 
and recovered masses is induced mostly by projection effects. 

Quite reassuringly, despite all the assumptions and possible systematics 
affecting both dynamical optical and X-ray mass estimates, these two methods 
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Fig. 9. The relation between dynamical optical masses and masses derived from 
the X-ray temperature by assuming hydrostatic equilibrium (from |71|. left panel, 
and from |13(J| . right panel, based on SDSS spectroscopic data). 

provide in general fairly consistent results for both nearby (e.g., |711 1130| ) 
and distant (e.g., |9fi| ) clusters. Two examples of such comparisons are shown 
in Figure El In the left panel, we report the comparison between X-ray and 
optical dynamical masses |71) . This plot shows a reasonable agreement among 
the two mass estimates, although with some scatter. The right panel reports 
the comparison presented in |130j . In this plot, the triangles indicates the 
cluster with clear evidences of complex dynamics. Quite interestingly, the 
agreement between the two mass estimates is acceptable, with a few outliers 
which are generally identified with non-relaxed clusters. 

5.3 The self similar scaling 

The simplest model to explain the physics of the ICM is based on the assump- 
tion that gravity only determines the thermodynamical properties of the hot 
diffuse gas (HDj- Since gravity does not have a preferred scale, we expect clus- 
ters of different sizes to be the scaled version of each other as long as gravity 
only determines the ICM evolution and there are no preferred scales in the 
underlying cosmological model. This is the reason why the ICM model based 
on the effect of gravity only is said to be self-similar. 

If we define Ma c as the mass contained within the radius Ra c , encompass- 
ing a mean density A c times the critical density, then Ma c oc p c {z)A c R^ . 
Here p c (z) is the critical density of the universe which scales with redshift as 
p c {z) — Pc,oE 2 (z), where E{z) is given by ea. l|12l) . On the other hand, the 
cluster size R scales with z and Ma c as R oc M 1 ' 3 E^ 2 ^ 3 (z). Therefore, as- 
suming hydrostatic equilibrium, the cluster mass scales with the temperature 
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M Ac cx T 3 / 2 E- 1 {z). (39) 



If pgas is the gas density, the corresponding X-ray luminosity for pure thermal 
bremsstrahlung emission is 

L x = I (—) A(T)dV, (40) 

where A(T) cx T 1 / 2 . Further assuming that the gas distribution traces the 
dark matter distribution, p gas (r) cx Pdm(t), then 

Lx cx M Ac p c T 1/2 cx T 2 E(z) . (41) 
As for the CMB intensity decrement due to the thermal SZ effect we have 

AS cx j y{6)dQ cx d A 2 j ' Tn e d 3 r (x d A 2 T 5/2 E' 1 {z) , (42) 

where y is the Comptonization parameter, cLa is the angular size distance and 
n e is the electron number density. We can also write AS in a different way to 
get the explicit dependence on y$: 

AS cx yod^ 2 / dH (xy d A 2 M 2/3 E- i/3 (z) cx y d A 2 TE- 2 (z) . (43) 



In this way, we obtain the following scalings for the central value of the Comp- 
tonization parameter: 

y cx T 3 / 2 E(z) cx L% A E 1 ^{z). (44) 

Eas. H39|) . H41JI and l|44|l are unique predictions for the scaling relations 
among ICM physical quantities and, in principle, they provide a way to relate 
the cluster masses to observables at different redshifts. As we shall discuss in 
the following, deviations with respect to these relations witness the presence of 
more complex physical processes, beyond gravitational dynamics only, which 
affect the thermodynamical properties of the diffuse baryons and, therefore, 
the relation between observables and cluster masses. 



5.4 Phenomenological scaling relations 
Using the X-ray luminosity 

The relation between X-ray luminosity and temperature of nearby clusters 
is considered as one of the most robust observational facts against the self- 
similar model of the ICM. A number of observational determinations now 
exist, pointing toward a relation Lx cx T a , with a ~ 2.5-3 (e.g., [lTlp . pos- 
sibly flattening towards the self-similar scaling only for the very hot systems 
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with 10 keV [3]. While in general the scatter around the best-fitting re- 
lation is non negligible, it has been shown to be significantly reduced after 
excising the contribution to the luminosity from the cluster cooling regions 
jl(J6) or by removing from the sample clusters with evidence of cooling flows 
Q. As for the behaviour of this relation at the scale of groups, T^S 1 keV, the 
emerging picture now is that it lies on the extension of the Lx~T relation of 
clusters, with no evidence for a steepening although with a significant 

increase of the scatter |121) . possibly caused by a larger diversity of the groups 
population when compared to the cluster population. This result is reported 
in the left panel of Figure ITUl (from |121p . which shows the Lx~T relation for 
a set of clusters with measured ASCA temperatures and for a set of groups. 




Fig. 10. Left panel: the Lx~T relation for nearby clusters and groups (from |121p . 
The star symbols are the for the sample of clusters, with temperature measured from 
ASCA, while the filled squares and open circles are for a sample of groups, also with 
ASCA temperatures. Right panel: the evolution of the Lx~T relation, normalized 
to the local relation (from |l()9p . using Chandra temperatures of clusters at z > 0.4. 



As for the evolution of the Lx~T relation, a number of analyses have been 
performed, using Chandra |HS1 HE2I EZI IH2SI and XMM-Newton [2311111] data. 
Although some differences exist between the results obtained from different 
authors, such differences are most likely due to the convention adopted for 
the radii within which luminosity and temperature are estimated. In general, 
the emerging picture is that clusters at high redshift are relatively brighter, 
at fixed temperature. The resulting evolution for a cosmology with Q m = 0.3 
and Qa — 0.7 is consistent with the predictions of the self-similar scaling, 
although the slope of the high-z Lx~T relation is steeper than predicted by 
self-similar scaling, in keeping with results for nearby clusters. The left panel 
of Fig.HTHshows the evolution of the Lx~T relation from where Chandra 
and XMM-Newton observations of 11 clusters with redshift 0.6 < z < 1.0 
were analyzed. The vertical axis reports the quantity Lx/T B , where B is the 
slope of the local relation. Quite apparently, distant clusters are systematically 
brighter relatively to the local ones. However, the uncertainties are still large 
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enough not to allow the determination of a precise redshift dependence of the 
Lx~T normalization. 

As for the relation between X-ray luminosity and mass, its first calibration 
has been presented in |136j . for a sample of bright clusters extracted from 
the ROSAT All Sky Survey (RASS). In their analysis, these authors derived 
masses by using temperatures derived from ASCA observations and applying 
the equation of hydrostatic equilibrium, ea. (jri^jl . for an isothermal /3-model. 
The resulting M-Lx relation is shown in Figure El From the one hand, this 
relation demonstrates that a well defined relation between X-ray luminosity 
and mass indeed exist, although with some scatter, thus confirming that Lx 
can indeed be used as a proxy of the cluster mass. From the other hand, the 
slope of the relation is found to be steeper than the self-similar scaling, thus 
consistent with the observed Lx~T relation. 




Fig. 11. The Lx~M relation for nearby clusters (from |136| h X-ray luminosities are 
from the RASS, while masses are estimated using ASCA temperatures and assuming 
hydrostatic equilibrium for isothermal gas. 



Using the optical luminosity 

The classical definition of optical richness of clusters is known to be a poor 
tracer of the cluster mass (e.g., USD- However, the increasing quality of pho- 
tometric data for the cluster galaxy population and the ever improving ca- 
pability of removing fore/background galaxies thanks to larger spectroscopic 
galaxy samples have recently allowed different authors to demonstrate the 
optical/near-IR luminosities to be as reliable tracers of the cluster mass as 
the X-ray luminosity. 
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Two examples of recent calibrations between optical/near-IR luminosity 
and mass are shown in Figure ^] In the left panel we report the result pre- 
sented in |1(J(J| . based on K-band luminosites from the 2MASS and masses 
obtained by applying the M-T relation by Although the data points 
are rather scattered, they define a clear correlation. Quite interestingly, the 
best-fitting relation has a slope shallower than unity, thus indicating the K- 
band mass-to-light ratio is a (slightly) increasing function of the cluster mass. 
This result is in line with previous results using optical luminosities (e.g., |72] ) 
who found an increasing M/L when passing from galaxy groups to clusters of 
increasing richness. 
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Fig. 12. The relation between cluster masses and optical/near-IR luminosities. Left 
panel: M500-I/500 relation from |100| . using K-band data from the 2MASS survey 
and masses from X-ray data. The solid line is the best-fit power law, while the 
dashed line marks the unity slope. Right panel: the same relation, but in the iSloan 
band. Open and filled points corresponds to mass estimates based on the SDSS 
spectroscopic data and on the M-Tx relation, respectively. 



Popesso et al. |13U| analysed SDSS data for a set of clusters which have 
been identified in the RASS. Their mass estimates come from both X-ray 
temperature |136| and from the velocity dispersions as estimated from the 
SDSS spectroscopic data. The results of their analysis for the i band are shown 
in the right panel of Fig. El Again, the optical luminosity correlates quite 
tightly with the cluster mass, with an intrinsic scatter which is comparable 
to, or even smaller than that of the correlation between X-ray luminosity and 
mass. 

These results highlight how cluster samples with precisely measured optical 
luminosities can in principle be usefully employed to constrain cosmological 
parameters. However, while X-ray luminosity provides at the same time a 
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tracer of cluster mass and a criterion to precisely determine the sample selec- 
tion function, the latter quantity can be extracted from an optically selected 
sample only in a rather indirect way. 

6 Constraints on cosmological parameters 

In this section we will review critically results on cosmological constraints 
derived from different ways of tracing the cosmological mass function of galaxy 
clusters. 

6.1 The distribution of velocity dispersions 

A first determination of the mass function from velocity dispersions, a v , of 
member galaxies has been attempted in ^7]. Girardi et al. [SH] used a much 
larger sample of nearby clusters with measured velocity dispersions to compare 
the resulting mass function with predictions from cosmological models. The 
resulting relation between ag, and fi m was such that as — 1 for a fiducial value 
of the density parameter fl m = 0.3. More recently, data for nearby clusters, 
identified in the SDSS, have been used to calibrate a relation between richness 
and velocity dispersion JOj- They compared the resulting cr„-distribution to 
the prediction of cosmological models and found a significantly lower normal- 
ization of the power spectrum, eg — 0.7 for Q m = 0.3. Such differences from 
different analyses highlight the presence of systematic uncertainties in the 
relation between mass and observables (i.e., velocity dispersion and richness). 

The application of this method to distant clusters has been applied so far 
only to the CNOC sample 34 , which comprises 17 clusters selected from the 
EMSS out to z ~ 0.6. Still to date, this is the only sample of distant clusters, 
with calibrated selection function, for which velocity dispersions have been 
reliably measured. Bahcall et al. pointed out that the resulting evolution 
of the mass function is consistent with a low-density Universe. Borgani et 
al. [23 reanalysed this same sample and emphasised that the uncertainties in 
the local normalization of the mass function are large enough to make any 
constraints on fl m not significant. 

6.2 The temperature function 

The X-ray Temperature Function (XTF) is defined as the number density of 
clusters with given temperature, n(T). As long as a one-to-one relation exist 
between temperature and mass, the XTF can be related to the mass function, 
n(M), by the relation 



In this equation, the ratio dM/dT is provided by the relation between ICM 
temperature and cluster mass. 



n(T) 




(45) 
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Measurements of cluster temperatures for flux-limited samples of nearby 
clusters were first presented in [53]. These results have been subsequently 
refined and extended to larger samples with the advent of ROSAT, Beppo- 
.S^Xand, especially, ASCA. XTFs have been computed for both nearby (e.g., 
dnHinillilUHHI) and distant (e.g., [SSI E01 EH E21) clusters, and used to 
constrain cosmological models. The starting point in the computation of the 
XTF is inevitably a flux-limited sample for which the searching volume of 
each cluster can be computed. Then the Lx — Tx relation and its scatter is 
used to derive a temperature limit from the sample flux limit. 

Once the XTF is measured from observations, eq. l|45[) is used to infer 
the mass function and, therefore, to constrain cosmological models. A slightly 
different but conceptually identical approach, has been followed in |13fi| . where 
masses for a flux-limited sample of nearby bright RASS clusters have been 
computed by applying the assumption of hydrostatic equilibrium, thereby 
expressing their results directly in terms of mass function, rather than of 
XTF. 

Oukbir and Blanchard |122| first suggested to use the evolution of the XTF 
as a way to constrain the value of fl m . Several independent analyses converge 
now towards a mild evolution of the XTF, which is interpreted as a case for 
a low-density Universe, with 0.2^$ f2 m ^ 0.6. An example is reported in the 
right panel of Figure IT51 ( from |82p. which shows the comparison between the 
XTFs of the sample of nearby clusters [HSj and a sample of EMSS clusters 
with ASCA temperatures. 

A limitation of the XTFs presented so far is the limited sample size (with 
only a few 0.5 measurements), as well as the lack of a homogeneous sample 
selection for local and distant clusters. By combining samples with different 
selection criteria one runs the risk of altering the inferred evolutionary pattern 
of the cluster population. This can even give results consistent with a critical- 
density Universe |IT1 IT551 IT5| . 

Besides the determination of the matter density parameter, the observa- 
tional determination of the XTF also allows one to measure the normalization 
of the power spectrum, as- Assuming a fiducial value of Q m — 0.3, different 
(sometimes discrepant) determinations of as have been reported by differ- 
ent authors, ranging from as — 0.7-0.8 (e.g., |5*B1 11361 to as — 1 (e.g., 
|10fil 1128] ). Ikebe et al. [HHj compared different observational determinations 
of the XTF for nearby clusters (see left panel of Fig. I13|) and established that 
they all agree with each other reasonably well. Although quite comfortable, 
this result highlights that the discrepant results on the normalization of the 
as-f^m relation comes from the cosmological interpretation of the observed 
XTF, and not from observational uncertainties in its calibration. While the 
different model mass functions (i.e., whether Press-Schechter, Jenkins et al. 
or Shcth-Tormen) can in some cases account for part of the difference, more 
in general the different results are interpreted in terms of the different normal- 
ization of the M-T relation to be used in eq. (|45|l or to the way in which the 
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Fig. 13. Left panel: a comparison between the XTF for nearby clusters from |88| 
(shaded area), |1U6I (solid line)and J81J (dotted line). Right panel: the evolution of 
the XTF from |82| . Open and filled circles are for the local and the distant cluster 
sample, respectively. 



intrinsic scatter and the statistical uncertainties in this relation are included 
in the analysis. We shall critically discuss these issues in Section l6~51 below. 

Substantially improved observational determinations of the XTF, and cor- 
respondingly tighter cosmological constraints, are expected to emerge with the 
accumulations of data on the ICM temperature from the Chandra and XMM- 
Newton satellites. Thanks to the much improved sensitivity of these X-ray 
telescopes with respect to ASCA, temperature gradients can be measured for 
fairly large sets of nearby and medium-distant (z^> 0.4) clusters, thus allow- 
ing more precise determinations of cluster masses. At the same time, reliable 
measurements of global temperatures are now emerging for clusters out to the 
highest redshifts where they have been secured (e.g., |140j ). At the time of 
writing, several years after the advent of the new generation of X-ray tele- 
scopes, no determinations of the XTF from Chandra and XMM-Newton data 
have been presented, a situation that is expected to change quite soon. 



6.3 The luminosity function 

Another method to trace the evolution of the cluster number density is based 
on the X-ray luminosity function (XLF), <j){Lx), which is defined as the num- 
ber density of galaxy clusters having a given X-ray luminosity. Similarly to 
eq. lf4*5|l , the XLF can be related to the cosmological mass function of collapsed 
halos as 

<f>(L x ) =n[M(L x )}^, (46) 
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where M(Lx) provides the relation between the observable Lx and the cluster 
mass. The above relation needs to be suitably modified in case an intrinsic 
scatter exists in the relation between mass and temperature (see Section 
here below). 

A useful observational quantity, that is related to the XLF, is given by 
the flux number-counts, n(S), which is defined as the number of clusters per 
steradian, having measured flux S: 

(e.g., |94p where r(z) is the radial coordinate appearing in the Friedmann- 
Robertson-Walker metric: 

r{z)= I dzE- x (z) ■ n A = i-n m 

Jo 

, s 2[f? m z + (2-f? m )(i- yi + n mZ )] 
r(z) = n?jTTz~) 5 Qa = - (48) 



The flux S is related to the luminosity according to 



(49) 



where — r(z)(\ + z) is the luminosity distance at redshift z. 

This quantity can be measured for a flux-limited samples without having 
information on cluster redshift and provides useful cosmological information in 
the absence of any spectroscopic optical follow-up. A comparison between dif- 
ferent observational determinations of the flux number counts for both nearby 
and distant cluster samples (e.g., |138| 1 show indeed a quite good agreement. 

Another quantity, which has been used to derive cosmological constraints 
from flux-limited surveys, is the redshift distribution, n(z), which is defined 
as the number of clusters found in a survey at a given redshift z: 

( c \ 3 r 2 (z) r°° dM 
< Z )=\-H ) W) Js dS fs k y{S)n[M(S lZ )-z] — . (50) 

In the above expression Su rn is the limiting completeness flux of the survey, 
while f s ky (S) is the effective flux-dependent sky-coverage appropriate for the 
considered survey. Convolving the mass function with the sky coverage inside 
the integral in the above equation is essential to properly account for the dif- 
ferent effective area covered at different fluxes, an aspect which is apparently 
overlooked in some analyses (e.g., |158j ). 

In general, the advantage of using X-ray luminosity as a tracer of the 
mass is that Lx is measured for a much larger number of clusters in samples 
with well-defined selection properties. As discussed in Section l4~2l the most 
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recent flux-limited cluster samples contain now a fairly large (~ 100) number 
of objects, which are homogeneously identified over a broad redshift baseline, 
out to z ~ 1.3. This allows nearby and distant clusters to be compared within 
the same sample, i.e. with a single selection function. However, since the X- 
ray emissivity depends on the square of the gas density, the relation between 
Lx and M V i r , which is based on additional physical assumptions, is more 
uncertain than the M v - a —a v or the M v - lr -T relations. 




Fig. 14. A compilation of XLF within different redshift intervals for independent 
X-ray flux-limited surveys (from |119) L The shaded area shows the range of deter- 
minations of the local XLF, while the solid curve is the best-fitting evolving XLF 
from |H5| . 



A useful parametrization for the relation between temperature and bolo- 
metric luminosity can be casted in the form 

L " = L > (sHr)" (1 + «>* Got)" 1 » 44 "- 2 ^-' • <W 

with Lq defining the normalization of the relation and di,{z) the luminosity- 
distance at redshift z for a given cosmology. 

Analyses of the number counts from different X-ray flux-limited cluster 
surveys showed that the resulting constraints on Q m are rather sensitive to 
the evolution of the mass-luminosity relation |94l 11071 124") . On the other hand, 
other authors |141U135lll58| analysed different flux-limited surveys and found 
results consistent with Q m = 1 . Quite intriguingly, this conclusion is common 
to analyses which combine a normalization of the local mass function, using 
nearby clusters, and to the evolution of the mass function using deep surveys. 
Clearly, any uncertainty in the calibration of the selection functions when 
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combining different surveys may induce a spurious signal of evolution of the 
cluster population, possibly misinterpreted as an indication for high fi m . 

In order to overcome this potential problem, Borgani et al. (291 ana- 
lyzed the RDCS sample to trace the cluster evolution over the entire redshift 
range, 0.05J5 1.3, probed by this survey, without resorting to any exter- 
nal normalization from a different survey of nearby clusters [2Hj • They found 
0.1^5 fi m ~ 0.6 at the 3er confidence level, by allowing the M-Lx relation to 
change within both the observational and the theoretical uncertainties. In 
Figure IT51 we show the resulting constraints on the a%-fi m plane (from |138p 
and how they vary by changing the parameters defining the M-Lx relation: 
the slope a and the evolution A of the Lx~T relation (see Eauation l51|) . the 
normalization f} of the M-T relation (see Eauation l36(l , and the overall scatter 
Am-Lx- Flat geometry is assumed here, i.e. fi m + Qa = 1- 

Similar results have been obtained by combining information on clustering 
properties and the redshift distribution from the the REFLEX cluster survey 
|144| . thus providing a% — 0.7 and fi m ~ 0.35. One should however notice that, 
since these constraints are derived from nearby clusters, the corresponding 
estimate of fi m comes from the shape of the CDM power spectrum, rather than 
from the growth rate of perturbations. It is rather reassuring that dynamical 
and geometrical constraints on fi m are in fact consistent with each other. 

Constraints on fi m from the cluster X-ray luminosity and temperature dis- 
tribution are thus in line with the completely independent constraints derived 
from the baryon fraction in clusters, /bar (e.g., |lfi9l 1581 14*|). 

6.4 The gas mass function 

An alternative way of tracing the mass function of galaxy clusters is based 
on using as its proxy the mass function of the cluster gas content |163| . This 
method is based on the assumption that galaxy clusters are fair containers 
of cosmic baryons. Similarly to the method based on the baryon fraction, 
it relies on the knowledge of the cosmic baryon fraction, either provided by 
data on the deuterium abundance in high-redshift absorption systems (e.g., 
92 ) combined with predictions of primordial nucleosynthesis, or from the 
spectrum of CMB anisotropics (e.g., |I051 ll49] '). This method has the potential 
advantage that cluster gas mass is an easier quantity to measure than the total 
collapsed mass, since it is essentially related to the total cluster emissivity. 

If we define nb(Mb) to be the baryonic mass function and n(M) the total 
mass function, by definition we have rib(Mb) = n(fi m Mb/ fib)- Therefore, once 
nb(Mb) and fib are known from observations, the total mass function can be 
computed as a function of fi m , thereby treated as a fitting parameter. 

While this method has the remarkable advantage of avoiding the uncer- 
tainties related to direct estimates of the total collapsed mass, it is affected 
by possible violations of the assumption of universality of the baryon content 
of clusters. Indeed, while this assumption should be valid for suitably relaxed 
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Fig. 15. Probability contours in the a$-n m plane from the evolution of the X-ray 
luminosity distribution of RDCS clusters. The shape of the power spectrum is fixed 
to r = 0.2 Different panels refer to different ways of changing the relation 

between cluster virial mass, M, and X-ray luminosity, Lx, within theoretical and 
observational uncertainties (see also |29p. The upper left panel shows the analysis 
corresponding to the choice of a reference parameter set. In each panel, we indi- 
cate the parameters which are varied, with the dotted contours always showing the 
reference analysis. 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 a = 3 : 
- A=0 J 
4 0=1.15 - 


i 1 i i i 1 i i i 1 i i i 1 i i i 
L A=l j 

"^^^^ " 


i 1 i i i 1 i i i 1 i i i 1 i i i 
r a = 2.5j 


J 1 1 1 1 1 1 1 1 1 1 1 L 

-^^/3=1.5 - 
i 1 i i i 1 i i i 1 i i i 1 i i i 


j 1 1 1 1 1 1 1 1 1 1 1 1 f 

j\ /?= 1.15 - 
J\A M . k =20%r 

i 1 i i i 1 i i i 1 i i i 1 i i i 


j 1 1 1 1 1 1 1 1 1 1 1 1 1 r 

: = 0.8 : 
| A M _ k =20%" 

=% i 

i 1 i i i 1 i i i 1 i i i 1 i i i 



and massive clusters, it may be less so when considering all objects belong- 
ing to a flux-limited sample, thus including also relatively small clusters and 
structures with a complex non-relaxed dynamics. 

This method was applied |164j to a set of bright clusters selected from the 
RASS [Ha and found cr 8 = 0.72 ± 0.04 with Q m h = 0.13 ± 0.07. Chandra 
observations were included in the analysis for a set of clusters extracted from 
the 160 deg 2 survey They found an evolution of the gas mass function, 

which is consistent with a flat cosmological model with f2 m = 0.3. 

We emphasize here that the above different methods used to reconstruct 
the mass function of galaxy clusters consistently prefer relatively low values 
of as, in the range 0.7-0.8. Quite remarkably, such values have been shown 
now to be required by the 3-years WMAP data release ( .149 ). 
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6.5 Including uncertainties in the analysis 

As we have discussed in the previous sections, most of the analyses of the 
cluster populations converge toward a low-density model, with Q m ~ 0.3. 
However, significant differences exist between different determinations of the 
normalization of the power spectrum, as, which amount to up to ~ 20 per cent. 
These differences are much larger that the statistical uncertainties associated 
to the finite number of clusters included in the samples, thus indicating that 
they arise from unaccounted sources of error, which affects the analyses. 

For instance, the role of the uncertain normalization of the mass-temperature 
relation in the determination of a% (at fixed £2 m ) from the XTF analysis has 
been emphasized by different authors (e.g., [55111271 Il45| ). Increasing the nor- 
malization of the M-T relation implies that a larger mass corresponds to a 
fixed temperature value. As a consequence, an observed XTF translates into 
a larger mass function, therefore implying a larger as (at fixed f2 m ). Since 
results from hydrodynamical simulations generally imply a larger M-T nor- 
malization, a larger as is expected when using in the analysis the simulation 
predictions. The left panel of Figure Hoi (from shows how the best fit- 
ting values of a% and Q m from the local XTF change as one uses different 
mass-temperature relations, taken from both observations and simulations. 
The right panel of FigfTnl (from |127| ) show the dependence of as on the nor- 
malization of the mass-temperature relation. Note that this normalization is 
allowed here to vary over the range encompassed by observational and simu- 
lation results. The range of variation of as induced by the uncertainty in the 
M-T relation is at least comparable to the purely statistical uncertainty, as 
indicated by the errorbars. 

One may wonder why not relying only on the observational determination 
of the M-T relation, instead of considering also simulation results. As we shall 
discuss in Section[7| observational results can not necessarily provide the most 
reliable determination of the M-T scaling. 

The effect of changing the normalization of the M-T relation on the anal- 
ysis of flux-limited samples through the XLF evolution is shown in the lower 
left panel of FigE3 If this normalization is reduced by ~ 30%, the resulting 
as decreases by ~20%. 

It is clear that, any uncertainty, both statistical and systematic, in the 
fitting parameters describing the scaling relations between mass and observ- 
able, must be included in the analysis by marginalizing over the probability 
distribution function of these parameters. Let n be the set of cosmological 
parameters that we want to constrain, and W the set of parameters which 
define a scaling relation between mass M and an observable X (i.e., a v , Lx 
or T). Let us also call P(W) the prior distribution for the uncertainties in the 
M-X relation. If x 2 (^> W) gives the goodness of fit provided by the choice $1 
of the cosmological parameters, for a given M-X relation, then the goodness 
of fit after marginalizing over the uncertainties in M-X reads 
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Fig. 16. Left panel: the dependence of the best-fitting values of Q m and a$, from the 
XTF of |88|. upon different determinations of the mass-temperature relation, from 
both observational data and from hydrodynamical simulations of galaxy clusters. 
Right panel: the dependence of as on the normalization of the mass-temperature 
relation, in the XTF analysis by |127| . 
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Of course, the marginalization generally induces an increase of the uncertain- 
ties in the cosmological parameters. Furthermore, one needs to have a reliable 
modeling of both size and distribution of the errors (i.e. whether they have a 
uniform, a Gaussian, or a more peculiar distribution). 

Besides the errors in the parameters defining the scaling relations, a differ- 
ent source of uncertainty is provided by the intrinsic scatter in these relations. 
Intrinsic scatter has the effect of widening the range of possible masses which 
correspond to a given value of the observable quantity. This effect can be in- 
cluded in the analysis by convolving the theoretical mass function with the 
distribution of the scatter itself (e.g., 12911980 . Let \P be an observable quantity 
and (/>(&) its distribution (i.e., XLF or XTF), to be compared with observa- 
tions. Also, let P(M#\M;z) be the probability of assigning a mass My to a 
cluster of true mass M, at redshift z, from the observable IP, for a given M- 
& relation. Therefore, the model prediction for the distribution (j)^), to be 
compared with its observational determination, is given by the convolution of 
the cosmological mass function with the distribution of the intrinsic scatter: 

6(&)d& = [ dMvn(Mv,z)P(Mv\M;z)^d& (53) 
7 air 
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where n{M, z) is the cosmological mass function at redshift z. If one makes 
the standard assumption of Gaussian scatter in the log-log plane, then 

P(M 9 \M) = (2ira^ M )~ l/2 exp[-a^(M 9 )] , (54) 

where x(M&) = (In Ms? — ]xiM)/{-\/2a\ n M) and o\ u m is the r.m.s. intrinsic 
scatter. The effect of this convolution is that of increasing </)(&), for a fixed 
n(M), as the scatter increases. Therefore, assuming a progressively larger scat- 
ter in the M-<P relation implies a progressively lower eg (at fixed f2 m )- An 
illustrative example of the effect of intrinsic scatter on the determination of 
a 8 is reported in Figure El In the left panel we show the REFLEX XLF 
along with the prediction of the best fitting cosmological model for a given 
choice of the M-Lx relation, after assuming vanishing intrinsic scatter in this 
relation. In the left panel, we show the same comparison, but assuming an 
Gaussian-distributed intrinsic scatter of 40 per cent in the M-Lx scaling. 
As expected, adding the scatter has the effect of increasing the predicted lu- 
minosity function, so that as has to be lowered from 0.8 to 0.65 to recover 
the agreement with observations. This example highlights that a good cali- 
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Fig. 17. The dependence of as on the intrinsic scatter assumed in the relation 
between X-ray luminosity and mass. The two panels show the comparison between 
model predictions and the observed XLF from the REFLEX sample [20] . The left 
panel assumes vanishing scatter while the right panel assumes a 40 per cent intrinsic 
scatter. The best fitting values of ag are reported in both cases. 



bration of the intrinsic scatter in the scaling relations can be as important as 
determining the best-fitting amplitude and slope of these relations. 
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7 The future 

A new era for cosmology with galaxy clusters is now starting. High sensitivity 
surveys for blind SZ identification over fairly large contiguous area, ~ 100 
deg 2 , have have already started or are planned in the coming years (see the 
lectures by M. Birkinshaw in this volume). Also, the Planck satellite will 
survey the whole sky, although at a much lower sensitivity, and provide a 
large set of clusters identified through the SZ effect. These surveys promise 
to identify several thousands clusters, with a fair number of objects expected 
to be found at z > 1. In the optical/near-IR bands, imaging with dedicated 
telescopes with large field of view will also allow to secure a large number of 
distant clusters. At the same time, X-ray observations over contiguous area 
(e.g., |17()j and "serendipitous" searches from XMM-Newton (e.g., |118j ) and 
Chandra (e.g., |3U]) archives will ultimately cover several hundreds deg 2 down 
to flux limits fainter than those reached by the deepest ROSAT pointings. 
Preliminary results suggest that identification of z > 1 clusters may eventually 
become routine |118| . Ultimately, they will lead to the identification of several 
thousands clusters. 

Optimized optics for wide-field X-ray imaging have been originally de- 
scribed in a far-reaching paper by Burrows et al. and proposed for the 
first time to be implemented in a dedicated satellite mission in the mid 90's. 
There is no doubt that this would be the right time to plan a dedicated 
wide-field X-ray telescope, which should survey the sky over an area of sev- 
eral thousands deg 2 , with a relatively good, XMM-like or better, point spread 
function and a low background. This instrument would be invaluable for stud- 
ies of galaxy clusters, thanks to its ability of both identifying extended sources 
with low surface brightness. Several missions with a similar profile have been 
proposed, although none has been approved so far. Still, the community work- 
ing on galaxy clusters is regularly proposing this idea of satellite to different 
Space Agencies (e.g., [79]b 

The samples of galaxy clusters obtainable from large-area SZ and X-ray 
surveys contain in principle so much information to allow one to constrain not 
only J? TO and as, but also the Dark Energy (DE) content of the Universe (see, 
e.g., |15ril 1142] for introductory reviews on Dark Energy). The equation of 
state of DE is written in the form p = wp, where p and p are the pressure and 
density terms, respectively. The parameter w must take values in the range 
— 1/3 > w > —1 for the DE to provide an accelerated cosmic expansion. 
Constraining the value of w and its redshift evolution is currently considered 
one of the most ambitious targets of modern cosmology. Ones hope is to unveil 
the nature of the energy term which dominates the overall dynamics of the 
Universe at the present time. 

As we have mentioned, the limited statistics prevents current cluster sur- 
veys to place significant constraints on Q\. Although this limitation will be 
overcome with future cluster surveys, the question remains as to whether the 
systematic effects, discussed in Section 16.51 can be sufficiently understood. 
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Different lines of attack have been proposed in the literature, which should 
not considered as alternative to each other. 
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Fig. 18. Constraints on cosmological parameters from the SZ SPT survey, from the 
SZ Planck survey and from a wide-area deep X-ray survey (from top to bottom 
panels; from |lU5p . Dotted lines: no uncertainty in the relation between cluster 
mass and observables, using only the dn/dz information; long-dashed line: using 
self-calibration on the dn/dz; dot-dashed line: as before, but also including the 
information on the cluster power spectrum; short-dashed line: using dn/dz and 
a calibration of the mass-observable relation for 100 clusters; solid line: all the 
information combined together. 



Majumdar and Mohr |105| have proposed the approach based on the so- 
called self-calibration (see also |1041 1971 198p . The idea underlying this ap- 
proach is that of parametrizing in a sensible way the scaling relations between 
cluster observables and mass, including the corresponding intrinsic scatter and 
its distribution. In this way, the parameters describing these relations can be 
considered as fitting parameter to be added to the cosmological parameters. 
As long as the cluster samples are large enough, one should be able to fit at the 
same time both cosmological parameters and those parameters related to the 
physical properties of clusters. Figure ^| (from 105 ) shows the constraints 
that one can place on the w-£2 m (left panel) and on the ag,-f2 m planes, after 
marginalizing over the other fitting parameters, from different SZ and X-ray 
surveys. In each panel different contours indicate the constraints that one can 
place by progressively adding information in the analysis. The main message 
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here is that combining information on the evolution of the cluster population 
and on its clustering can place precision constraints on cosmological parame- 
ters. These constraints can be further tightened if follow up observations are 
available to precisely measure masses for 100 clusters. 

These forecasts nicely illustrates the potentiality of the self-calibration ap- 
proach for precision cosmology with future surveys of galaxy clusters. Clearly, 
the robustness of these predictions is inextricably linked to the possibility of 
accurately modeling the relations between mass and observables. 

A line of attack to this problem is based on using detailed hydrodynamical 
simulations of galaxy clusters. The great advantages of using simulations is 
that both cluster mass and observable quantities can be exactly computed. 
Furthermore, the effect of observational set-ups (e.g., response functions of 
detectors, etc.) can be included in the analysis and their effect on the scaling 
relations quantified. This approach has been applied by different groups in the 
case of X-ray observations [M] to understand the relation between the ICM 
temperature, as measured from the fitting of the observed spectrum, and 
the "true" mass-weighted temperature (e.g., [1081 IllOl llfiOj ). Furthermore, 
simulations can also be used to verify in detail the validity of assumptions on 
which the mass estimators, applied to observations, are based. The typical ex- 
ample is represented by the assumption of hydrostatic equilibrium, discussed 
in Sect l5.ll for which violations in simulations at the 10-20 per cent level have 
been found (e.g., [T2I IT351 1271 IST] 1 ■ 

An example of calibration of observational biases in the mass-temperature 
relation, using hydrodynamical simulations, is shown in FigurelTWfrom 133 ), 
which provides a comparison between the observed and the simulated M— 
T relation. Simulations here include radiative cooling, star formation and 
the effect of galactic winds powered by supernovae, and, as such, provide 
a realistic description of the relevant physical processes. The observational 
results, which are taken from corresponds to mass estimates based on 
the hydrostatic equilibrium for a polytropic /3-model of the gas distribution 
[eq. <E3J>] • I n both panels, the temperature has been computed by using a 
proxy to the actual spectroscopic temperature, the so-called spectroscopic- 
like temperature jllOj . The left panel shows the results when exact masses 
of simulated clusters are used for the comparison. Based on this result only, 
the conclusion would be that simulations do indeed produce too high a M— 
T relation, even in the presence of a realistic description of gas physics. In 
the left panel, masses of simulated clusters are computed instead by using 
the same procedure as for observed clusters, i.e. by applying ea . 1)34(1 for the 
hydrostatic equilibrium of a polytropic /3-model. Quite remarkably, the effect 
of applying the observational mass estimator has two effects. First, the overall 
normalization of the M-T relation is decreased by the amount required to 
attain a reasonable agreement with observations. Second, the scatter in the 
simulated M-T relation is substantially suppressed. This is the consequence of 
the fact that eq. ((34(1 provides a one-to-one correspondence between mass and 
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Fig. 19. The mass-temperature relation at p/ per = 500, in simulations (filled circles 
and triangles) and for the observational data (squares with errorbars, |62|~). The left 
panel is for the true masses of simulated clusters; the right panel is for masses of 
simulated clusters estimated by adopting the same procedure applied by Finoguenov 
et al. to observational data (from |133p . 



temperature, while only the cluster-by-cluster variations of (3 and 7 account 
for the intrinsic diversity of the cluster thermal structure. 

This example illustrates how simulations can be usefully employed as 
guidelines to study possible biases on observational mass estimates. However, 
it is worth reminding that the reliability of simulation results depends on 
our capability to correctly provide a numerical description of all the relevant 
physical process. In this sense, understanding in detail the (astro)physics of 
clusters is mandatory in order to calibrate them as tools in the era of precision 
cosmology. 

As a concluding remark, we emphasize once more that a number of inde- 
pendent analysesof the cluster mass function, which have been realized so far, 
favor a relatively low normalization of the power spectrum, with as — 0.7-0.8 
for fi m ~ 0.3, thus in agreement with the most recent WMAP results ( |149p . 
This agreement must be considered as a success for cluster cosmology and 
a strong encourgement for future applications to large cluster surveys of the 
next generation. 
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